Question Terminology And Representation For Question Type Classification
نویسنده
چکیده
Question terminology is a set of terms which appear in keywords, idioms and fixed expressions commonly observed in questions. This paper investigates ways to automatically extract question terminology from a corpus of questions and represent them for the purpose of classifying by question type. Our key interest is to see whether or not semantic features can enhance the representation of strongly lexical nature of question sentences. We compare two feature sets: one with lexical features only, and another with a mixture of lexical and semantic features. For evaluation, we measure the classification accuracy made by two machine learning algorithms, C5.0 and PEBLS, by using a procedure called domain cross-validation, which effectively measures the domain transferability of features.
منابع مشابه
دستهبندی پرسشها با استفاده از ترکیب دستهبندها
Question answering systems are produced and developed to provide exact answers to the question posted in natural language. One of the most important parts of question answering systems is question classification. The purpose of question classification is predicting the kind of answer needed for the question in natural language. The literature works can be categorized as rule-based and learning...
متن کاملارایه یک پیکره پرسش و پاسخ مذهبی در زبان فارسی
Question answering system is a field in natural language processing and information retrieval noticed by researchers in these decades. Due to a growing interest in this field of research, the need to have appropriate data sources is perceived. Most researches about developing question answering corpus area have been done in English so far, but in other languages as Persian, the lack of these co...
متن کاملPolitical Terms by APLL: Issues of Terminology Implantation and Acceptability
The present study investigates the implantation of political science terminology approved by the Academy of Persian Language and Literature (APLL) in the Hamshahri corpus made up of news text from Hamshahri newspaper and their acceptability among MA students of English translation studies (ETS), English literature (EL), and Political science (PS). To conduct this research the frequencies of the...
متن کاملIranian Women, Inside or Outside of the Stadium? An Anthropological Study on Female Representation of National Identity in Iran
A controversial and comprehensive debate that has resulted in numerous discursive clashes in Iran pertains to the presence of women at stadiums during male soccer matches. Different discourse systems have expressed their own contradictory and opposite stances in terms of whether Iranian women have the right to attend such events inside or outside the stadium, ranging from different notions of r...
متن کاملExploiting Paraphrases in a Question Answering System
We present a Question Answering system for technical domains which makes an intelligent use of paraphrases to increase the likelihood of finding the answer to the user’s question. The system implements a simple and efficient logic representation of questions and answers that maps paraphrases to the same underlying semantic representation. Further, paraphrases of technical terminology are dealt ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002